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Vectors, Cells and Processes for 




Pyrimidme Deoxvribonucleosides Production 



Field of the Invention 

5 The present invention relates to the production of pyrlmldines, purines and derivatives 
thereof e.g. deoxyribonucleosides, using genetically modified cells comprising novel 
DNA constructs. 

Background of the invention 

10 

Thymidine is useful as a pharmaceutical intermediate, particularly for the chemical 
synthesis of azidothymidine ("AZT," sold under the trademark ZIDOVUDINE). 
Although ZIDOVUDINE-type AZT was one of the first therapies developed for 
HIV/AIDS, it continues to have important and expanded use (Langreth, R., The Wall 

15 Sfreef Journal, Nov. 21 , 1 995, pp B1 2). ZIDOVUDINE-type AZT is valuable particularly 
when used in combination therapies such as a combination with lamivudine (also 
known as 3TC), sold under the trademark EPIVIR. This lamvudine and 3TC 
combination is sold under the trademark COMBIVIR. Although the HIV virus can 
mutate to form resistance to either AZT or 3TC, COMBIVIR-type nucleotide-analog 

20 combination is particularly effective because the reverse transcriptase apparently 
cannot be resistant to both nucleoside analogues at the same time (Larder, B.A. etaL, 
Science 269; 696-699, 1995;. ZIDOVUDINE-type AZT is also useful in conjunction with 
HIV protease inhibitor type drugs (Waldholz, M., The Wall Street Journal, Jan. 30, 
1996, pp B1), and in the treatment of HIV infected pregnant women in order to reduce 

25 the frequency of infection of the fetus at birth. In 1 997 about 600,000 children died of 
AIDS contracted from their mothers at birth. ZIDOVUDINE-type AZT taken for several 
months prior to birth can reduce the transmission of the virus to infants by two-thirds. 
Thymidine produced by chemical synthesis used in the manufacture of AZT is a very 
significant cost. 

30 

In U.S. 5,213,972 (McCandliss & Anderson, hereinafter 'the '972 patent"), the entire 
contents of which are incorporated herein by reference and to which the reader is 
specifically referred, a process for the production of pyrimidine deoxyribonucleoside 
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(PdN) is disclosed (see in particular examples 7 to 14 of the '972 patent). A 
repiicatable microorganism comprising and expressing a DNA sequence encoding a 
pyrimidine deoxyribonucleotide phosphohydrolase that converts a PdN monophosphate 
to a pyrimidine deoxyribonucleoside is taught. More particularly, McCandliss & 
5 Anderson, supra, describe a fermentation method that can be used to produce 
thymidine that Involves the expression of deoxythymidylate phosphohydrolase 
(dTMPase) from the Bacillus bacteriophage PBS1 . This type of enzyme has been 
found in nature expressed by bacteriophages that do not contain thymidine in their 
DNA, but instead incorporates compounds like deoxyuridine or hydroxymethyl- 
10 deoxyuridine. 



In the thymidine fermentation described in the '972 patent, the enzymes that degrade 
thymidine (thymidine phosphorylase and uridine phosphory(ase) have been removed by 
mutation so that thymidine accumulates. Thus, the use of the dTMPase enzyme helps 

15 create the pathway to allow thymidine synthesis. An expression of dTMPase alone, 
however, may not assure a commercially viable level of thymidine production. 
Accordingly, there is a continuing need to enhance the production of thymidine by cells 
expressing dTMPase in order to make thymidine production by fermentation 
commercially viable, by lowering the production cost relative to the current chemical 

20 synthesis methods. 



The biochemical pathway for pyrimidine deoxynucleotide production, for example, in E. 
coli is highly regulated at the levels of transcription and translation as well as at the 
protein level by mechanisms including attenuation, feedback inhibition and enzyme 

25 activation. Neuhard, J. and R.A. Kelln. Biosynthesis and Conversion of Pyrimidines, 
Chapter 35 [In] Neidhardt, F.C. etai [eds] '"Escherichia coli and Salmonella Cellular 
and Molecular Biology", Second Edition, Vol. I, pp580-599, ASM Press, Washington 
D.C., 1996. The expression of dTMPase and elimination of thymidine breakdown by 
mutations in the cfeoA (thymidine phosphorylase), udp (uridine phosphorylase) and tdk 

30 (thymidine kinase) genes and therefore resulting expression products results in 
thymidine synthesis in £. coli but not at a commerically viable level. 
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Summary of the Invention 



The biosynthesis of purines and pyrimidines involves a common step of reducing a 
ribonucleoside diphosphate (in some species triphosphate) to its corresponding deoxy 
5 analog. In the overall process the reduction of the ribose moiety to 2-deoxyribose 
requires a pair of hydrogen atoms which are ultimately donated by NADPH and H^ 
However, the immediate electron donor is not NADPH but the reduced form of a heat 
stable protein called thioredoxin or glutaredoxin and at least one other unidentified 
source since the E. coli ribonucleotide reductase system still works in trxA (thioredoxin) 
10 gp( (glutaredoxin) double mutants (Neuhard and Kelln, supra). The reducing 

equivalents of the reduced thioredoxin are transferred to ribonucleoside diphosphate 
reductase which carries out the reduction process. Manipulation of, for example, this 
step could prove useful in improving the commerical production of purine and 
pyrimidine deoxynucleosides. 

15 

It is an object of the present invention to provide novel DNA constructs e.g. vectors and 
genetically modified microorganisms comprising said vectors particularly for use in the 
production of recoverable amounts, especially commercially useful amounts, of 
pyrimidine and purine deoxynucleosides. 

20 

It is also an object of the present invention to provide processes which represent an 
improvement over McCandliss and Anderson described supra. 

In accordance with one aspect of the present invention there is provided a DNA 
25 construct comprising a transcriptional unit which comprises a ribonucleotide reductase 
gene and a thioredoxin gene or a uridine kinase gene and/or a dCTP deaminase gene. 



In one embodiment the DNA construct comprises a transcriptional unit which comprises 
a ribonucleotide reductase gene and a thioredoxin gene. 

In another embodiment the DNA construct comprises a transcriptional unit which 
comprises a uridine kinase gene and/or a dCTP deaminase gene. 
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Preferably the DNA construct comprises a transcriptional unit which comprises a 
uridine kinase gene and a dCTP deaminase gene. 

Most preferably the DNA construct comprises a transcriptional unit which comprises a 
ribonucleotide reductase gene and a thioredoxin gene and a uridine kinase gene and 
a dCTP deaminase gene. 

In accordance with another aspect of the present invention there is provided a modified 
host cell comprising a DNA construct according to the invention. 

In accordance with yet another aspect of the present invention there is provided a 
culture medium comprising the modified host cells of the invention and processes for 
the production of a purine or pyrimidine, for example thymidine ,comprising the use of 
said modified host cells . 



In one embodiment the host cells comprise a DNA construct which construct comprises 
a transcription DNA unit (e.g. operon) which unit comprises DNA sequences encoding 
for ribonucleotide reductase and thioredoxin in which said reductase preferably displays 
less sensitivity to allosteric inhibition than a wild type host cell equivalent or counterpart 
20 wherein said cell further comprises one or more of the following features: 

(a) a transcription unit (e.g. operon), preferably located on said DNA construct, 
comprising DNA sequences encoding for (and preferably heterologous with respect to 
host cell equivalent) thymidylate synthase; 

(b) a transcription unit (e.g. operon), preferably located on said DNA construct, 
25 comprising DNA sequences encoding for uridine kinase and preferably dCTP 

deaminase; and 

(c) repressed or absent Uracil DNA glycosyiase activity. 



In another embodiment the DNA construct for use in the production of recoverable 
30 amounts of pyrimidine and derivatives thereof, in particular pyrimidine 

deoxyribonucleosides such as thymidine, comprises a transcription unit (e.g. operon) 
which unit comprises ( preferably heterologous) DNA sequences encoding for uridine 
kinase and/or dCTP deaminase. 

4 
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Genetically modified host cells comprising and expressing the construct and culture 
medium comprising the modified host cells are also provided. 

5 This aspect is based, in part, on the observation that host cells comprising DNA 

encoding for uridine kinase and/or dCTP deaminase, optionally togather with additional 
genes as suggested In US 5,213,972 required for thymidine production, lead to a 
significant improvement in thymidine production. 

10 The respective aspects of the present invention disclose for the first time a plurality of 
advances on the teaching of U.S 5,213,972 to provide improved DNA constructs and 
host cells comprising the constructs for use in the commercial production of pyrimidine 
deoxyribonucleosides, particularly thymidine. 

Other objects, features and advantages of the present invention will become apparent 
from the following description. It should be understood, however, that these represent 
preferred embodiments of the invention and are by way of illustration only. Various 
modifications and changes within the spirit and scope of the invention will become 
apparent to those skilled in the art. 

Preferred embodiments of the invention 



The construct of the present invention may be chromosomal or more preferably extra- 
chromosomal e.g. located on a vector. 

25 

Vectors of the present invention include plasmid, virus, transposons, minichromosome 
or phage, preferably plasmid. The vector comprising the transcription unit may be 
introduced into the host cell according to any convenient method known to those skilled 
in the art, e.g. PI transduction, electroporation or transformation. Suitable host cells 
30 useful in the present invention include eukaryotes and prokaryotes (e.g. Bacterium). 
Prokaryotes include E. co//, Salmonella, Pseudomonas, Bacillus, strains and mutants 
thereof. E.co// is preferred due to the large amount of information, genetic tools and 
mutant alleles that are available. It is particularly preferred that a method of 

5 
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transduction is available for the host cell of choice to enable mutations to be readily 
moved from one host cell to another and facilitate genetic mutation of the host without 
requiring direct mutation whenever a new mutation is desired. 

5 The present inventors have found that the use of bacteriophage T4 nrdA, nrdB and 
nrdC genes are particularly useful for encoding the reductase and thioredoxin in E. colL 
See Sjoberg, B. M. etaL, EMBOJ., 5:2031-2036 (1986); Tseng, M.-J., etai, J. Biol. 
Chem. 263:16242-16251 (1988); and LeMaster, D.M.. J. ViroL 59:759-760 (1986). 
More specifically, a very significant improvement in E. coli thymidine production was 

10 achieved through the cloning and expression of the T4 bacteriophage nrdA and nrdB 
genes coding for ribonucleotide reductase together with T4 nrdC coding for thioredoxin 
since the T4 ribonucleotide reductase cannot use E. coli thioredoxin. The T4-coded 
ribonucleotide reductase was found to be relatively insensitive to control by allosteric 
inhibition in vitro compared to the E, coli enzyme (Berglund, O., J. Biol. Chem. 

15 247:7276-7281 , 1 972). For example, unlike the E. coli enzyme (Berglund, O., J. Biol. 
Chem. 247: 270-7275, 1972) the T4 ribonucleotide reductase is not inhibited by dATP, 
but actually stimulated by dATP and ATP (Berglund, O., J. Biol. Chem. 247:7276-7281 , 
1972). 



20 DNA sequences encoding for the ribonucleotide reductase ( e.g. nrdA and nrdB genes) 
and thioredoxin (e.g. nrdC gene) are preferably heterologous with respect to host cell 
DNA and preferably derived from T phage (preferably E. co//T bacteriophage), 
particularly T "even" phages e.g. T2, T4 or T6. See Campbell, A.M., Bacteriophages, 
Chapter 123, In Neidherdt, supra; and Mathews, C.K. et al. (eds.) Bacteriaophage T4, 

25 American Society of Microbiology, Washington, D.C., 1983. The term "derived from" is 
intended to define not only a source In the sense of its physical origin but also to define 
material which has structural and/or functional characteristics which correspond to 
material originating from the reference source. 

30 Another useful feature of the T even phage enzyme is its substrate specificity. The 
normal E. coli ribonucleotide reductase uses UDP as a substrate only poorly since the 
K„ for UDP is about 10 fold higher for UDP than CDP (Neuhard and Kelln, supra). 
However, the T4 enzyme has only a two-fold difference in (Berglund, O., J. Biol. 




wo 01/02580 PCT/GBOO/02357 

Chem. 247: 7276-7281, 1972) between CDP and UDP substrates allowing two routes 
to dUTP synthesis. Although there have been attempts to obtain functional expression 
of T4 ribonucleotide reductase in E. coh\ previous efforts were only successful in 
expressing the components separately and could demonstrate activity only by mixing in 
5 vitro (Tseng, M.-J., P. He. J.M. Hilfinger, and G.R. Greenberg, J. Bacterioi 172: 6323- 
6332, 1990). Whilst not being bound by theory, the inventors believe that perhaps due 
to the lack of the usual pattern of feedback inhibition, expression of T4 ribonucleotide 
reductase in E. coti is lethal and it must be carefully conditionally expressed. Further 
envisaged are genes that encode precursor forms of the reductase and/or thioredoxin 
10 which are processed to produce a mature form. Such processing may proceed via 
various intermediate forms. 

Vectors of the present invention preferably comprise a regulatory element ( e.g. 
promoter such as lambda Pl, operator, activator, repressor such as lambda repressor, 

15 particularly a temperature sensitive variant, and/or enhancer), appropriate termination 
sequences initiation sequences and ribosome binding sites. The vector may further 
comprise a selectable marker. Alternatively, regulatory elements (particularly lambda 
repressor) may be located on the host cell chromosome. It is preferred that nrofA and 
nrdB are arranged in the vector downstream (in terms of reading frame) from nrdC In 

20 particular, it is preferred that nrdB Is arranged downstream from nrd/K, Thus a most 
preferred arrangement is a vector comprising an operon comprising nrdCAB. 

The T4 ribonucleotide reductase is not devoid of feedback-control in vivo (J. Ji, R.G. 
Sargent, and C.K. Mathews, J.BioLCIiem, 266: 16289-16292, 1991 ; and Berglund 

25 supra). To promote ribonucleoside diphosphate reduction further e.g. for thymidine 
production, the gene coding for the regulatory subunit, nrdA, may be modified by, for 
example, a mutational approach to create an enzyme capable of increased thymidine 
production due to e.g. a reduced sensitivity to altosteric inhibition for example inhibition 
by the enzyme's immediate product or inhibition by a product resulting from a 

30 downstream event. 

In order to construct T4 nrdA mutants, site-directed mutagenesis may be used to 
modify or change (e.g. substitute) gene bases encoding amino acids suspected to alter 

7 
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e.g. dTTP binding site involved in allosteric regulation. Analysis of the amino acid 
sequence of T4 ribonucleotide reductase revealed a segment that appears to fit well 
with a postulated consensus sequence thought to be involved in dTTP binding (E.M. 
Mcintosh and R.H. Haynes, MoL Cell. Biol. 6: 1711-1721, 1986). Several changes may 
5 be made in this region of the T4 ribonucleotide reductase using oligonucleotide-directed 
mutagenesis. The general approach may be modelled after the effort of More etaL 
(More, J.T., J. M. Ciesia, L.-M. Changchien. G.F. Maley and F. Maley, Biochemistry 33: 
2104-21 12, 1994) to reduce the dTTP binding of deoxycytidylate deaminase. One 
mutation. ^Ala to lie, in the T4 nrdfK appeared to be very useful. For example, the 
10 thymidine productivity of strains containing the ^®Ala to lie mutant in T4 nrdfK evaluated 
by a shake flask fermentation method was significantly increased. As demonstrated 
below, the present inventors achieved at least 25% increase over the parent strain 
without this single change. 

15 Although the ^^Ala to lie is one successful example, those skilled in the art will now 
realize that many other amino acid changes to this region are now possible to obtain 
the desired effect, that being to putatively disrupt dTTP binding, but not disrupt the 
enzyme's basic functionality. For example, substitution of ^®Ala with other amino acids 
displaying similar side chains to lie (e.g. leucine, valine) may be utilized. Modifications 

20 of position 79 in conjunction with other modifications (e.g. mutations) within the 

postulated consensus region are also envisaged. Deletion of one or more amino acid 
positions in the consensus region and introduction of synthetic DNA into the region are 
other approaches available to those skilled in the art. 



25 In another aspect of the present Invention there is provided a host cell comprising a 
construct which construct (e.g. vector) comprises a transcriptional unit comprising DNA 
sequences encoding for heterologous ribonucleotide reductase and thioredoxin which 
reductase is less sensitive to allosteric inhibition than the wild type host cell equivalent 
or counterpart. It will be apparent to those skilled In the art that determining the relative 

30 sensitivity of a candidate heterologous reductase to allosteric inhibition compared to 
the wild type host cell equivalent is a matter of routine experimentation and observation. 
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Transcription units comprising the DNA sequences e.g. nrc/A, nrdB and nrdC genes are 
preferably operons wherein the nrd genes are arranged in tandem. This permits 
transcription of these genes as a single mRNA transcript. In order to minimize 
unproductive energy expenditure by the host cell and further to minimize plasmid size, it 
5 is preferred that the operon contains only genetic sequences required in the encoding 
of reductase and thioredoxin (including any regulatory or control elements). This may 
necessitate the removal of superfluous DNA (for example, the unusual Intron in the 
phage T4 nrdB gene, Sjoberg, B-M., et al EMBO J.5: 2031 -2036, 1 986). 



10 In other preferred embodiments, vectors of the present invention for use in for example 
the production of thymidine further comprise DNA sequences encoding for thymidylate 
synthase ( e.g. the td gene). See e.g. Chu, F. K. ef a/., Proc. NatL Acad. Sci. USA 
81:3049-3053 (1984); Chu, F. K.etai, J, Bacterioi 169:4368-4375 (1987). The 
purpose of using this enzyme is to improve control over the levels of deoxyuridine 

15 produced and in particular the relative impurity level of deoxyuridine relative to 

thymidine. The dTMPase enzyme Is not completely specific for dTMP. With a higher 
K;„than for dTMP, the PBS1 dTMPase will also utilize dUMP as substrate to produce 
deoxyuridine (Price, A. R., Methods in EnzymoL 51 : 285-290, 1978). Deoxyuridine 
creates a significant problem for thymidine purification. Therefore, one way to reduce 

20 deoxyuridine production is to efficiently convert dUMP to dTMP by increasing the level 
or effectiveness of thymidylate synthase such that the internal concentration of dUMP 
always remains very low. 



The thymidylate synthase gene {td) may be heterologous with respect to the host cell 
25 and it is preferred that td is derived from (in the sense defined supra) T bacteriophage, 
e.g. T "even" phage and in particular T4 phage td. Although td may be located in its 
own transcription unit, it is preferred that td is located in the same transcription unit e.g. 
operon as nrd genes. Moreover, it is preferred that td is located in the same operon 
downstream (in terms of reading frame) from the nrd genes, 

30 

McCandliss and Anderson, supra, amplified the E. coli thymidylate synthase gene in 
plasmids pCG138 and pCG148 (see Table 5, of the '972 patent) and it was found to be 
partially effective in reducing deoxyuridine. The T4 thymidylate synthase is much more 

9 



WO 01/02580 PCT/GBOO/02357 

effective which is surprising in light of the fact that the E .coli enzyme is not thought to 
be controlled by any type of allosteric regulation (Neuhard and Kelln, supra). The E. 
coli enzyme for dUMP, 4 \xM (Wahba, A. J. and M. Friedkin, J. Biol Chem. 237: 
3794^3801), and the T4 enzyme for dUMP, 2.73 \xM (Maley, R, L LaPat-Polasko, 

5 V. Frasca and G.F. Maley, Functional domains in T4 Thymidylate Synthase as probed 
by site-directed mutagenesis. Chapter 29 [In] Karam, J.D. led] "Molecular Biology of 
Bacteriophage T4", American society for Microbiology, Washington, DC, 1994, pp 322- 
325), are similar and cannot explain the large difference in effectiveness. Whilst not 
being bound by theory, the inventors believe that the E. ooli thyA has an internal 

10 transcription termination sequence derived from an upstream gene that could be 
effecting the expression level in plasmid clones (Bell-Penderson, D, J.L. Galloway - 
Salvo, and M. Belfort, J. BacterioL 173: 1 193-1200, 1991). 



In other preferred embodiments, host cells of the present invention, particularly for use 
15 in the commercial production of pyrlmidine deoxyribonucleosides e.g. thymidine 

comprise a transcription unit (e.g. operon) which unit comprises ONA sequences e.g. 
udk gene encoding for uridine kinase and preferably DNA sequences e.g. dcd gene 
encoding for dCTP deaminase. See e.g. Wang, L. and B. Weiss, J. Bacterial. 
174:5647-5653 (1992); and Neuhard, J. and L. Tarp0, J. Bacterial. 175: 5742-5743. 

20 

The construct of this aspect of the invention may additionally comprise a transcription 
unit encoding for ribonucleotide reductase (nrdA and nrdB) and the thioredoxin (nrofC), 
or precursor forms thereof which are preferably heterologous with respect to host cell 
DNA and preferably derived from E.coli bacteriophage, particularly T "even" phages e.g. 
25 T2,T4 or T6, 

Uridine kinase produces UMP and CMP from uridine and cytidine using GTP (or dGTP) 
as the phosphate donor. The reaction is inhibited by UTP and CTP (J. Neuhard and 
R.D. Kelln, Biosynthesis and Conversions of Pyrimidines, Chapter 35 [in] F.C. Neidhart 
30 et al. [eds], " Escherichia Coli and Salmonella Cellular and Molecular Biology", Second 
Edition, ASM press, Washington D.C). The present Inventors have found that the use 
of uridine kinase particularly together with dCTP deaminase leads to a marked 
improvement in the production of thymidine by host cells incorporating these changes 
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together with the teachings of *972 outlined supra. This observation is quite 
unexpected since uridine kinase, on the basis of current information, has no direct role 
in pyrimidine de novo biosynthesis, moreover that its use would be beneficial in 
commercial processes for the production of pyrimidine deoxyribonucleosides. It is 
5 preferred that udk and cfccf genes are arranged in tandem in the same operon. Further 
envisaged are genes that encode precursor forms of the udk and dcd gene which are 
processed to produce a mature form. Such processing may proceed via various 
intermediate forms. The udk and dcd genes may be introduced into the construct ( e.g. 
vector) from any suitable source by methods well known to those skilled in the art for 
10 example PI transduction, electroporation or transformation. 



The enzyme uracil DNA glycosylase. encoded by the ung gene, is responsible for 
degrading DNA that has uracil incorporated in place of thymine. Where host cells of the 
present invention are used in the commercial production of e.g. thymidine, the internal 

15 cellular concentration of dTTP may be lowered as a result of the utilization of dTMP ( a 
precursor of dTTP) in the production of thymidine. Accordingly, the present inventors 
have recognized that there is potentially a greater propensity for uracil incorporation 
into the host DNA which may be lethal to a wild type host due to the uracil DNA 
glycosylase activity causing too many single stranded breaks in the host cell DNA. 

20 Thus, host cells useful in the present invention may further display repressed 
(compared to the unmodified cell) or no uracil DNA glycosylase activity. This 
repression or absence may be achieved through various ways apparent to those 
skilled in the art. For example, antagonism ( either total or partial) of the ung gene 
expression products is one such approach by introducing an antagonist of the 

25 functional enzyme (or precursor thereof) into the host cell. Other approaches include 
manipulating ung gene expression by e.g. modifying regulatory elements of ung gene 
expression or introducing mutations into the ung gene itself such that ung gene product 
expression displays little or no uracil DNA glycosylase protein and/or activity. Another 
approach is to delete the ung gene ( or functionally critical parts thereof) from host cell 

30 DNA. The absence or low level of uracil DNA glycosylase activity may be a feature of 
the host cell without the need for further manipulation. 
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In preferred embodiments of the present invention, each of the advances taught herein 
are incorporated into a host cell. The nrd, td, udk and dcd genes may be located on 
separate constructs but it is preferred that they are all located on the same construct 
e.g. vector. Thus in a particularly preferred embodiment of the present invention, a 

5 modified host cell is provided in which the cell comprises a DNA construct (e.g. vector) 
comprising a transcription DNA unit (e.g. operon) which unit comprises DNA sequences 
encoding for (preferably a T even phage, e.g. T4) a modified ribonucleotide reductase 
and thioredoxin in which said reductase preferably displays less sensitivity to allosteric 
inhibition than wild type host cell equivalent or counterpart wherein said construct 

10 further comprises: 



(a) a transcription unit (e.g. operon) encoding for (preferably heterologous with respect 
to host cell equivalent or counterpart) thymidylate synthase and; 

(b) a transcription unit (e.g. operon), encoding for uridine kinase and preferably dCTP 
15 deaminase; 

and in which the host cell displays repressed or absent uracil DNA glycosylase 
activity. 



Host cells modified according to the present invention are particularly useful in the 
20 commercial production of pyrimidine deoxynucleosides. In a particularly advantageous 
use of the present invention, E. coli host cells comprising (harboring) a plasmid 
modified according to the present invention (particularly in conjunction with the 
teachings of the '972 patent) may be used in the commercial production of thymidine. 
Thus, host cells modified according to the present invention may further comprise 
25 dTMPase derived from e.g. PBS1 and the mutations taught in the *972 patent, e.g. 
deoA, tdk-l and udp-1. 



Generally, a fermentation method is employed which involves submerging the cells in a 
culture medium contained within a suitable vessel. Following culturing under 
30 appropriate conditions, produced thymidine is harvested and purified (enriched), if 
necessary, to pharmaceutical grade according to standard protocols. The purified 
thymidine may then be used in the production of medicaments, e.g. pharmaceutical 
compositions such as AZT. 

12 
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Brief description of the drawings 

The present invention is illustrated by way of example only and with reference to the 
5 following figures in which: 

Fig.1 illustrates, schematically, a route for the construction of pCG366. It should be 
noted that the plasmids are not drawn to scale. 

10 Fig.2 illustrates, schematically, a route for the construction of pCG374 and pCG375. 

Fig. 3 illustrates a map for plasmid pCG532. 

Fig.4 illustrates growth and thymidine production by recombinant E. coli strain 
15 CMG2451 (Ung*) and CMG2492(Ung ) hosting a plasmid pCG366 (nrdCAB td) 
according to example 10. 

Fig.5 illustrates thymidine production in a 30-liter fermentor by E. coli CMG2451 
(pCG532). 

20 

Fig. 6 illustrates TdR and UdR (mg/L) obtained according to the purification protocol of 
example 1 2. 

Example 1 

25 Cloning of T4 nrdCAB genes and demonstration of activity 

The bacteriophage T4 nrdAB genes were cloned by performing the polymerase chain 
reaction (PGR) with isolated phage T4 DNA. The primers for PGR nrdA gene were: 
5'-TAT TCT AGA CGA TTT TCA AGT TGA GGA CTT ATG C-3' (Seq. id 1); 
30 and 5*-TAT ATG GAT AAT TCA TTA CAA TTT AGA CGG TGC AG-3' (Seq. id 2). 
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The restriction site Xba\ was the introduced at the beginning of the nrdA in the amplified 
DNA, and C/al was introduced at the 3* end of nrdA. The primers for PGR amplification 
of nrdB gene were: 

5'-TAT ATC GAT AAA TGT AAA TTT AAG GAT TCT AAA TG-3' (Seq. Id 3) and 
5 5'-TAT GTC GAG TCG TTA AAA GTA TTT TTT AAA AGT G-3' (Seq. id 4). 



The restriction site C/al was thus introduced at the beginning of the nrdB, and Apa\ was 
introduced at the end of nrdB In the amplified DNA. The PGR fragments were cloned 
into plasmid vectors as illustrated in Fig. 1 according to techniques known to those 

10 skilled in the art. The cloned nrdAB genes were confirmed by enzyme activity assay. 
The T4 nrdC gene was cloned into pKC30 producing plasmid pDL51 (LeMaster, D.M., 
J. Virology 59: 759-760, 1986 ) and was supplied by D. LeMaster (Dept of Biochem., 
Univ. Wisconsin, Madison, Wl). The gene was sub-cloned into a plasmid with nrdAB 
genes as illustrated in the Figure 1 . The sources of starting materials and background 

15 information used in Figure 1 are listed in Table 1 . 

A synthetic transcriptional terminator was used for the construction of pGG198 and 
pGG301 (see Fig. 1 and Table 1). Specifically, plasmid pBC sk"^ obtained from 
Stratagene (La Jolla, Galifornia) was digested with restriction enzyme Apa I and 
20 Asp7^ 8 I. Synthetic DNA containing the ECOTGP transcription termination sequence 
(d' Aubenton Carafa et ai, J. MoL Biol. 216: 839-843, 1990) was then ligated replacing 
the original sequence 
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Table 1: Genealogy of Plasmid pCG366 and pCG532 and Materials Source 



Plasm id 


Gene($) 


Pro- 
moter ; 


Vector 
origin 


Marker 


Derivation 


PACYC177 


amp kan 




p15A 


amp kan 


Chang and Cohen (1978) J. 
Bacteriot. 134:1141-11 56. ATCC 
37031 


pBC sk+ 


Cam 


P/ac 


CotE\ 


Cam 


Stratagene.l 101 1 North Torrey 
Pines Road, La Jolla, CA 92037. 


pBluescript 
Wks 


Amp 


P/ac 


coiE1 


Amp 


Stratagene,1 101 1 North Torrey 
Pines Road, La Jolla, CA 92037. 
ATCC 87047 


pBR322 


amp tet 




co^E^ 


amp tet 


Bolivar et aL (1977) Gene 2:95-113. 


pDL51 


T4 nrdC 




co/El 


Amp 


LeMaster, J, ViroL 59: 759-760. 
1986 


pKC30 


Amp 




CO/E1 


Amp 


Shimatake, and Rosenberg. (1981) 
Nature 292:128. ATCC 37286 


PKTdAl 


JAtd 




cofEi 


Amp 


West et al. (1 986) J.B.C. 
261:13446-13450. 


pT7/T3/18 


Amp 


phage 
T7/T3 


colEl 


Amp 


Lile Technologies, INC.,9800 
Medical Center Dr., Rocl<ville, MD 
20897. 


r%X7^ Ql 1 

p 1 Z.1 oU 


Amp 




COIci 


Amp 


Rin.RaH 9nnn Alfrori Mnhol Hr 

Hercules, CA 94547. 


pUUio 


Amp 


Kiac 


COfci 


Amp 


T aniSCi 1 r CI 1 on t7l cti. ^ 1 909^ y^VifV 

33:103-119. ATCC 37253 


pUCl9 


Amp 


Plac 


co^E^ 


Amp 


Yanisch-Perron et al- (1985) Gene 
33:103-119. ATCC 37254 


pCGl73 


XPi_ vector 


XP. 


colE\ 


Amp 


XPl in pBluescript II ks 


pCG196 


T4 nrdC 




colE^ 


Amp 


T4n/x/CinpUC19 


pCG198 


Cam 


Plac 


colE\ 


Cam 


Synthetic transcription terminator in 
pBC s/c+ 


pCG301 


XPl vector 


XP^ 


CO/E1 


Cam 


XP\^ \n pCG198 


pCG308 


T4 nrdC 


XP, 


CO/E1 


Cam 


T4 nrdC in pCG301 , LeMaster, J. 
Virol. 59: 759-7o0, i9oo 


pv^oo 1 ^ 


1 H nrur\ 


PT7 


Cut JO. \ 


__ 

Amp 


T4 nrHA in nT7/T3/1 ft 


p\^VJIO 1 o 


/ Vr CJD 


PT7 






T4 nrcfB in nBluscriDt II ks 


DCG323 


NrdAC 




CO/E1 


Cam 


74 nrd AO in pCG301 


pCG326 


NrdABC 




CO/E1 


Cam 


T4 nrdABC in pCG301 


pCG332 


NrdAB 


^Pl 


CO/E1 


Cam 


74 nrdAB In pCG301 


pCG334 


NrdAB 


^Pl 


CO/E1 


Cam 


74 nrdAB in pCG301 


pCG337 


NrdCAB 


^Pl 


co/El 


Cam 


T4 nrdOAB in pCG301 


pCG343 


NrdCAB 




CO/E1 


Cam 


74 nrdCAB in pCG301 


pCG356 


Td 


?iPl 


CO/E1 


Cam 


T4 td in pCG301 


pCG358 


dcd, udk 




CO/E1 

(pUCIB 

) 


Amp 


E. CO// dcd, ud/r in pUCl8 


pCG360 


NrdCAB. td 


XPl 


CO/E1 


Cam 


T4 nrdCAB td in pCG301 


pCG366 


NrdCAB, td 


^Pl 


CO/E1 

(pBR32 

2) 


Cam Tet 


T4 nrdCAB, fd in pBR322, Amp^ 


pCG374 


dcd, udk 




p15A 


amp 


E. CO// dcd. udk in pACYC177 


pCG376 


XP^ vector 


^PL 


p15A 


amp 


pACYCI 77 with XPl & MCS of 
pCG301 


pCG464 


r4nrdA 

Xbal/Hindin 

fragment 




(pTZlS 
U) 

co/El 


Amp 


Xbal/Hm<Ml\ fragment with nrdA 
sequence cloned into^al/ffrndlll sites 
of pTZlSU (DioRad Laboratories) 
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14 nraA jwait 

Hindin 

Fragment 




(pi Zlo 
U) 

colEl 


Amp 


i^CKwH/f Willi iJlUiallv/Ii in If nrurx 

sequence causing '^^ala tie alteration 


pCG494 


NrdCAB td 




colEl 

(pBR32 

2) 


Cam 
Tet 


KpnllAflW fragment from pCG494 
inserted into pCG366 adding '^^ala -> 
ile mutation to nrd\ 


pCG532 


NrdCAB, td, 
udk, Dcd 




co/EI 

(pBR32 

2) 


Cam 


udk,dcd cloned into pCG494 



between Apa I and Asp7^ 8 I endonuclease sites. This fragment recreated the Apa I 
recognition sequence, but destroyed the Asp 71 8 I recognition sequence in the new 
plasmid . The inserted DNA had the following composition 
5 5* -CGAGC CCGCCTAATG AGGGGGCTTT TTT TT - 3' 

3'-CCGGGGTGG GGCGGATTAC TCGGGGGAAA AAAAACATG - 5' 
(shown as respective strands in seq. id 5 and 6) 
produced from two oligonucleotides. 



10 Plasmid pCG198 was combined with pCG173 that contains the lambda Pl promoter 
from plasmid pKC30 cloned into pBluescript II ks (Stratagene, La Jolla, California) as 
shown in Figure 1. Both pCB198 and pCG173 were digested with Hind III andAsn I 
then ligated to create new plasmid pCG301 containing the lambda Pl promoter, multiple 
restriction enzyme cloning sites, and followed by the ECOTGP terminator sequence 

15 copied from the tryptophan operon leader peptide region. 



T4 ribonucleotide reductase activity was measured by HPLC by a method that does not 
involve the use of radioisotopes and UDP substrate. The direct ribonucleotide 
reductase assay contained 1 mM NADPH, 1 mM DTT. 0.5 m/W dATP, 0.6 m/lf UDP, 20 

20 mM Tris (pH 8.0) and 5 mM MgClg. Under these conditions the E. coli ribonucleotide 
reductase is inhibited and is not detected. The enzyme reaction (100 ixL) was stopped 
by the addition of 10 jjL of 50% trichloroacetic acid (TCA). After 10 minutes on ice, the 
samples are centrifuged in a microcentrifuge. The supernatant was extracted 4 times 
with diethylether to remove the TCA. Five ml of Tris buffer (1 .0 M pH 8.0) was added 

25 followed by 2 |iL of 40 mg/mL rattle snake venom (Sigma), and the sample was 

incubated at 37*'C for 60 minutes. The samples were then heated for 3 minutes at 70°C 
followed by centrlfugation for 5 minutes to remove precipitate. The volumes are 
equalized, then analysed by HPLC with an UV detector and deoxyuridine as the 

16 
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Standard. The column is a Spherisorb ODS-2, 5 micron, 250 mm X 4.6 mm using a 12 
rr\M ammonium phosphate (pH 5.0) mobile phase and a flow rate of approximately 1 .0 
mL/minute. Results are shown in Table 2 for cells containing plasmid pCG343 
demonstrating functional expression of T4 ribonucleotide reductase. 

5 

Table 2: Ribonucleotide Reductase Activity 



strain 


Induction condition 


Specific Acitivity 
nmol/10 minVrng protein 


CMG1093 


Uninduced 


0 


CMG1093 


Induced 


0 


CMG1093/pCG343 


Uninduced 


0 


CMGl093/pCG343 


Induced 


704.7 



EXAMPLE 2 

10 Derivation of Host Strain CMG2451 from Strain CMG1115 

Strain CMG1 1 15 is fully described in McCandliss & Anderson (U. S. Patent 5,213. 972). 
CMG1 115 was the starting point for development described herein. Strain CMG1 115 
was improved for thymidine productivity by selection for growth on medium containing 

15 30 mg/L of 5-fluorouridine that yielded strain CMG2401. Strain CMG2401 was then 
selected for growth on medium containing 30 mg/l of 3*-azido-3'-deoxythymidine which 
yielded strain CMG2404. CMG2404 requires L-proline for growth due to the inherited 
mutation A{laC'pro) from its original parent JM101 . Hfr mating between CMG2404 and a 
Hfr stain GAG5053 ( Singer, M. et al. Microbiological Review: 5:1-24,1989) was 

20 performed according to techniques known to those skilled in the art and yielded strain 
CMG2434 which is Lac*, Pro*. The udp (uridine phosphoryiase) mutation in CMG2434 
still had partial uridine phosphoryiase activity that was evident based on thymine 
accumulation after induction of thymidine production. The udp mutation was 
reintroduced from CGSC5128 {E. coli Genetic Stock Center, Yale University) by phage 

25 PI transduction according to techniques known to those skilled in the art. The 

mefE3079::Tn70 from strain CAG18491 was first transduced into CMG2434 to serve as 
a positive selection marker for transduction of udp. Then the udp-1 was transduced 
into the n7efE3079::TntO derivative of CMG2434 by selecting for growth without L- 
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methionine in the defined medium. The t/dp-1 derivative was named strain CMG2451 . 
The genealogy of CMG2451 is summarized in Table 3. 

Table 3: Genealogy of E. coli Host Strain CMG2451 

5 



strain 


_ ^ .Genotype , . 




CMG1115 


CMG1 106 (Tn5::dTMPase kan") 


Tn5::dTMPase insertion from pGGl32. 


CMG2401 


CMGl115FUdR" 


5-fluoro-2'deoxyuridlne resistance. 


CI\/IG2404 


CMG2401 AZT« 


3'-Azido-3'deoxythymidine resistance. 


CMG2434 


CMG2404 Lac* Pro* 


Repair A{/ac-proAB) by conjugation with 
CAG5053^ 


CMG2448 


CMG2434 mefE3079::TntO 


metE3079::Tn10 from CAG18491^ 


CMG2451 


GMG2448 L/c/p-1 metE* tef 


udp-1 from CGSC5128 ^ and replaced 
metE3079::Tn10. 



*• Mutations were at times introduced into E. coli strains by phage P1 transduction. If the mutation 
has a selective marker, direct P1 transduction was used. If the mutation has no selective marker, P1 
cotransduction with the nearby JnIO insertion was used. 
10 Singer, M., et al. Microbiologicaf Review: 53: 1 -24, 1 989. 

All CGSC strains can be obtained for E. coli Genetic Stock Center. Yale University, P.O. Box 
208104, New Haven, CT 06520-8104. 

EXAMPLE 3 

15 Cloning and expression of T4 td gene Into thymidine production plasmid 

The T4 td gene was cloned into pKT dAI by West et al. ( J. Biol. Chem 261 :1 3446- 
13450, 1986) without the1017-base pair intron. The td gene was sub-cloned into 
pCG301 using two oligonucleotides as linkers with the following sequences: 
20 5*-GAT CCG GAG GAT AAA TGA AAC AAT ACC AAG ATT TAA T-3' (seq. id 7) and; 
5'-TAA ATC TTG GTA TTG TTT CAT TTA TCC TCC G-3' (seq.id 8). 

The result plasmid was pCG356. The td gene from pCG356 was then sub-cloned into 
pCG343 to create pCG360 (Fig. 1). The tetracycline resistance gene and plasmid 
25 replication origin from pBR322 (Bolivar, F. et al.. Gene 2: 95-1 13, 1977) was sub- 
cloned into pCG360 and formed pCG366. The thymidylate synthase activity was 
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measured by spectrophotometric method of Wahba and Friedkin (J. Biol.Chem. 236: 
PC1 1 -PCI 2, 1 961 ). The results are shown in Table 4. 



Table 4: Thvmidviate Synthase Activity 

5 



strain 


Specific Activity 
AOD340/mg proteiiVmin 


CMG1093 


-0.72 


CMG1093/pKTdDI 


3.24 


CMGl093/pCG356 


3.45 


CMG1093/pCG360 


1.43 



EXAMPLE 4 

Shake flask data showing the value of T4 td gene on thymidine production and 
deoxyuridlne reduction 

10 

The shake flask fermentation was used to evaluate the thymidine productivity of 
different E. coli recombinants. The shake flask fermentation broth and methods used 
here, and in other examples, is described in Example 6 below. In this case 20 mL 
volume per flask was inoculated with 2 ml of seed culture and incubated in a 30°C 

15 shaker. At about 1 0 OD 600nm, the flasks were transferred into a SJ^'C shaker for 30 
minutes to mildly induce the X promoter. Then the flasks were transferred into a 35°C 
shaker to continue the fermentation. Glucose was fed during fermentation as needed, 
and pH was adjusted to about 7 with ammonia according to the color of phenol red. The 
thymidine concentration was measured by HPLC using a Spherisorb ODS-2, 5 micron, 

20 250-mm X 4.6 mm column, and a 25 mM ammonium phosphate (pH 3.3) mobile phase 
with a flow rate of approximately 1 .5 ml/minute. 



The strain CMG2451/pCG366 (T4 nrdCAB, tcfj was compared with CMQ2451/pCG343 
(T4 nrdCAB) in shake flask fermentation. Table 5 shows that deoxyuridlne 
25 concentration was reduced, and converted to thymidine in strain CMG2451/pCG366 
due to the T4 td gene. 
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Table 5: Thymidine Production-Effect of Thymidvlate Synthase at 66 hours . 



stain 


Thymidine (mg/l) 


Deoxyuridine (mg/l) 


% Deoxyuridine 


CMG2451/PCG343 


1198 


1728 


144.2 


CMG2451/pCG366 


3327 


206 


6.2 



EXAMPLE 5 

5 

Construction of the ^Ala to He T4 ribonucleotide reductase mutant 

The mutation was Introduced by usfng site-directed mutagenesis based on a method 
described by Kunkel (Kunkel, T.A., Proc. NatL Acad ScL ,USA, 82: 488-492,1985). All 

10 materials for mutant construction including pTZ18U phagemid DNA, M13K07 helper 
phage, bacterial strains E. coli CJ236 and MV1 190, T7 DNA polymerase and T4 DNA 
ligase were provided in the Muta-Gene in vitro mutagenesis kit from Bio-Rad 
Laboratories (Hercules, CA). At first, theXbal/H/ndlll DNA fragment containing the nrdA 
gene of T4 bacteriophage was isolated from plasmid pCG312 (ChemQen Corp., Table 

15 1 above). The Xbal/Hind\\\ DNA fragment was cloned into the Xba\IHin6\\\ sites of the 
pTZ18U phagemid vector using standard protocols (Sambrook, J., Fritsch, E.F. and 
Maniatis, T., Molecular Cloning, Cold Spring Harbor Laboratory, New York ,1989). The 
Insert-carrying phagemid pCG464 was Introduced into E. coli GJ236. This strain is 
deficient for dUTPase (dut) and uracil-N-glycosylase {ung) which results in an 

20 occasional substitution of uracil for thymine in newly synthesized DNA, Single stranded 
DNA of pCG464 containing uracil was isolated from CJ236 according to Bio-Rad 
Laboratories Instructional Manual. This DNA (0.2 pMoie) was annealed with 6 pMoie of 
phosphorylated primer 

5'-AGC AAA CAT TAA ACA GCG TGC AATTAC ATA TTG ATA ATC AGG TTC-3' 
25 (sequence id 9) containing the sequence of the desired mutation (underlined) coding for 
lie instead of original ^®Ala. 

Complementary strand DNA was synthesized by using T7 DNA polymerase as 
described in the Bio-Rad protocol. The reaction products were transformed into E. coli 
30 MV1 190 containing a wild type uracil-N-glycosylase, which degrades the uracil- 

containing parental strand, thus enriching for the mutant strand. Direct DNA sequencing 

20 
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using the Silver Sequence DNA Sequencing System from Promega Corp. (Madison, 
Wl) identified plasmids containing the desired mutation. It appeared that all four 
analysed transformants contained plasmids with the mutation. One of them was 
designated pCG492 and used for further experiments. To check if the mutation affects 

5 thymidine synthesis It was introduced into production plasmid pCG366 (ChemGen 
Corp., Table 1 and Figure 1). For this, the Kpn\/Afl\\ DNA fragment of pCG366 
containing 5'-part of the nrdA gene was replaced with Kpnl/AffU fragment from pCG492 
that contains the mutation. The new plasmid pCG494 was introduced into production 
strain CMG2451 (ChemGen Corp., Example 2 and Table 4 above). The effect of T4 

10 nrdA mutation was evaluated by comparison of thymidine production by CMG2451 
(pCG494) and CMG2451 (pCG366) as shown in Example 6 below. 



EXAMPLE 6 

Shake flask fermentation for Thymidine production using CMG2451 (PCG366) and 
15 CMG2451 fpCG494) 



The 250-m/- baffle flasks containing 25 mL of production medium were inoculated with 
2 mL of a freshly grown seed culture in LB broth with appropriate antibiotic added- The 
cultures were grown in a 30°C shaker at 250 rpm. When the ODgoo reached about 5, the 

20 flasks were transferred into a 37°C shaker for 30 min. Then the flasks were transferred 
into a 35^C shaker to continue the fermentation. The production medium has the 
following composition (g/L): Ardamine YEP-S (Red Star Yeast & Products, Milwaukee, 
Wl) - 10; CaCOa -10; MgS04 - 0.4; phenol red - 0.24; PP90BT (DMV International, 
Fraser, N.Y.) - 4.5; sorbitol - 20; chloramphenicol - 0.03; trace elements (1000X) - 

25 ImU/L. The trace elements (1000X) formulation is the following (g/L): boric acid - 0.05; 
calcium chloride - 20; cobalt sulfate - 0.05; copper sulfate - 0.01 ; ferrous sulfate - 20; 
ferric chloride - 20; manganese sulfate - 0.5; sodium molybdenate - 0.1 ; and zinc 
sulfate - 0.1 . At the time of induction 10 grams per liter of Ardamine YEP-S was added. 
Glucose was fed during the fermentation on an average of every two hours (2.5 g/L). 

30 The pH in the flasks was maintained at approximately 7.0 through the addition of 4N 
NH4OH as judged by the color of the phenol red indicator dye. The ODeoo was read after 
sample dilution 1 :10 into 10 mM HgSO^ to dissolve salts in the medium. Thymidine 
concentration was measured by reverse phase C-18 HPLC with an Alltech Spherisorb 
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ODS-2 column and Shimadzu spectrophotometric detector at 260 nm. The mobile 
phase was a 25 mM NH4H2PO4 (pH 3.3) in water at the constant rate of 1 .5 mL/nnin. 



The results of thymidine production by CMG2451 (pCG366) and CMG2451 (pCG494) 
5 after 2, 1 7 and 25 hours after induction are presented in Table 6. There were two 
repeats of the flasks in this experiment and variability between duplicate flasks did not 
exceed 15%. The results show that the T4 nrdA mutant performed better than the wild 
type nrdA strain. This was confirmed in several Independent shake flask experiments 
with the same bacterial strains. 

Table 6: Thymidine production by CMG2451 (pCG366) 
and CMG2451 (pCG494) 



strain 


2 hours 1 7 hours 


25 hours 


15 


Specific activity 


(mg/L/OD) 


CMG2451 (pCG366) 


6.1 32.5 


43.1 


CMG2452 (pCG494) 


8.5 36.1 


51 .4 



20 EXAMPLE 7 

Shake Flask Fermentation to Demonstrate the Effect of the £. coli udk Gene on 
Thymidine Production, 



The dcd gene or dcd udk operon were cloned into the pACYC177 vector. This vector 
25 with the p15a origin of replication is different from C0/EI based plasmids such as 
pCG366 and thus is compatible and can be maintained in the same host with co/EI 
based plasmids. The details of the plasmid constructions resulting in pCG374 {udk 
dcd) or pCG375 {dcd) are shown in Rgure 2. The genes on these plasmids are 
expressed from the native E. coli promoter of the udk dcd operon. 

30 

Using selection for ampicillin resistance, plasmids pCG374 and pCG375 were 
introduced into CMG2451 (pCG366) to test the effect on thymidine production in the 
shake flask fermentation method described in Example 6. The results at several time 

points are shown in Table 7. Although the specific activity (thymidine per OD of cells) is 

22 
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similar both with and without the udk gene, the cells with the udk gene on the second 
plasmid pCG374 grew to a higher cell density and produced significantly more 
thymidine (5.8 g/L compared to 3.0 g/L for the strain with the dcd only second plasmid 
). This result was not anticipated, as it is not clear why uridine phosphorylase could 
5 have this effect. 

Based on this data and other information, the udk gene was chosen for introduction 
along with the E. coli dcd gene In the construction of plasmid pCG532 (see Example 8 
and Figure 3). 

0 

Table 7: Comparison of the effect of second plasmids with dcd or dcd udk on 
thymidine production in the Ciy/IG2451 (pCG366) baclcground. 



Time 
(hour) 


O.D.600 


Thynn 
(mi 


Udine 
7/1) 


Specific Thymidine 
(mg/l/OD) 


Base 
Strain 
And 
Plasmid 


CMG 
241 
(pCG 
366) 


CMG 

2451 
(pCG 
366) 


CMG2451 
(pCG366) 


CMG2451 
(pCG366) 


CMG2451 
(pCG366) 


CMG2451 
(pCG366) 


Second 
pACYC 

177 
based 
plasmid 


PCG 

374 
with 
dcd 
udk 


pCG375 
with 
dcd 


pCG374 

with 
dcd udk 


pCG375 
with 
dcd 


pCG374 

with 
dcd udk 


PCG375 
with 
dcd 


8hr 


29.112 


29.94 


889 


884 


30.5 


29.5 


18 hr 


40.206 


32.946 


1721 


1608 


42.8 


48.8 


42 hr 


48.348 


33.96 


4020 


2740 


83.1 


80.7 


66 hr 


57.498 


28.614 


5804 


3012 


100.9 


105.3 



15 EXAMPLE 8 

Cloning of the £, coli udk dcd operon into production plasmid pCG494 , 



The udk and the dcd genes of E. coli encode pyrimidine ribonucleotide kinase and 
dCTP deaminase, respectively. Both genes were mapped to a 3.4 kb BamH\/Pst\ DNA 
20 fragment of lambda phage 355 of the Kohara genomic library (Kohara, Y., Akiyama, K. 
and Isono, K., Cell 50, 495-508, 1 987). It appears that udk is located upstream of dcd 
and transcribed in the same direction as dcd (Neuhard, J. and Tarpo, L, J. Bacteriol. 
175: 5742-5743, 1993). The genes were cloned into production plasmid by two steps. 

23 
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At first, a 3.4 kb BamH\/Pst\ DNA fragment from lambda 355 was cloned into the 
multiple cloning site of plasmid pUC18 (Yamisch-Perron, C, Vieira, J. and Messing, J., 
Gene 33: 103-109,1985) (plasmid pCG358). Then, the fragment was excised from the 
polylinker region of pCG358 with BamH] and Sph\ and cloned in place of a 0.7 kb 

5 Bgl\\ISph\ fragment (containing a portion of the tetracycline resistance gene) of 
production plasmid pCG494. The final 14.7 kb plasmid pCG532 contains co/EI 
compatibility group origin of replication, the chloramphenicol resistance gene, the udk 
and the dcd genes and the T4 bacteriophage nrdCAB (encode thioredoxin and two 
subunlts of ribonucleotide reductase, respectively) and the T4 td (encodes thymidylate 

10 synthase) genes under control of the Pl promoter of bacteriophage lambda. The T4 
nrdA gene of pCG532 was previously changed by site-directed mutagenesis C^A\a to 
He). 



A synthetic transcriptional terminator is located downstream of the td gene to prevent 
15 transcriptional readthrough into the replication region. The genetic map of plasmid 
pCG532 is illustrated in Figure 3. 

EXAMPLE 9 

Shake flask fermentation of thymidine bv CMG2451 (pCG494) and 
20 CMG2451(pCG532> 



Plasmid pGG532 containing the udk and the dcd genes of E. coli was introduced into 
production strain CMG2451 . New strain CMG2451 (pCG532) was tested together with 
parent strain CMG2451 (pCG494) in shake flask experiments to compare thymidine 
25 production. The results of two independent experiments are shown in Table 8. The first 
experiment was performed as described above and samples were taken at 1 7 hours 
after Induction. In the second experiment cells were induced at higher OD (about 9) and 
samples were taken at 3 hours after induction for analysis. In both cases the strain 
containing the cloned udk and dcd genes performed better than the parent strain. 
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Table 8: Thymidine fermentation bv thvmidin bv CMG2451 fpCG494> and 
CiyflG2451 rpCG532) 



cxpcrirneni i 








Strain 


O.D 600 


TdR (mg/L) 


Specific Activity(mg/L70D) 


CMG2451 (pCG494) 


16.5 


599 


36.3 


CMG2451 (pCG532) 


17.5 


867 


49.5 










Experiment 2 








Strain 








CMG2451 (pCG494) 


8.5 


149 


17.5 


CMG2451 (pCG532) 


8.4 


192 


22.8 



5 Example 10: Addition of unq mutation and its effect on thymidine production in 
Shal^e flasic fermentation 



An uracil DNA glycosylase negative strain was constructed by introducing an ung::Tn10 
(Varshney, U., et a!., J. Biol. Chem. 263:7776-7784. 1988) mutation into host CMG2451 

10 using P1 transduction as described above, and was named CMG2492. Plasmid 

pCG366 was introduced into CMG2492. A comparison experiment between Ung' and 
Ung"^ strain for thymidine synthesis in shake flasks is shown in Figure 4 using the flask 
method described in Example 6. Cells were grown in 250ml flasks at 30°C, and 
thymidine synthesis was Induced by shifting temperature to 37''C for 30 min, then, 

15 shifting to 35''C. The Ung' host without uracil DNA glycosylase kept growing longer and 
made 30% more thymidine. 

EXAltflPLEU 

Thymidine Production in a 30 Liter Fermentor with strain CMG2451 {pCG532) 

20 

The following conditions were used to produce thymidine in a 30-Liter fermentor (B. 
Braun Biotec Biostat C) with strain CMG2451/532. The seed culture (500 mL)was 
grown in a 4 Liter baffle shake flask in LB medium (5 g/L Difco yeast extract, 10 g/L 
Difco tryptone, 5 g/L NaCI) with 30 mg/L chloramphenicol and 25 mg/L kanamycin at 30 
25 C until 2.37 OD 600 nm was reached with a final pH of 6.69. 
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The initial batch in the fermentor (12 Liters) containing the composition listed in Table 9 
was sterilized at 121° C for 55 minutes. After cooling a separately autoclaved solution 
(500 mL) was added to adjust the batch to 20 g/L sorbitol and 3.0 g/L MgS04 7 HjO. 
Also added before inoculation was a sterile filtered solution (200 mL) designed to to 
5 adjust the initial batch to 30 mg/L chloramphenicol, 25 mg/L kanamycin, 1 mg/L d- 
biotin, 10 mg/L thiamine and 10 mg/L nicotinic acid. 



Three feed solutions were prepared: a) Cerelose 2001 (dextrose monohydrate) 562 g/L 
with 2 mg/L biotin, 20 mg/L thiamine, 20 mg/L nicotinic acid, and 30 mg/L 

10 chloramphenicol; b) sorbitol 717 g/L with with 4 mg/L biotin, 40 mg/L thiamine, 40 mg/L 
nicotinic acid, and 60 mg/L chloramphenicol; and c) crude nitrogen mixture containing 
360 g/L Amberex 695 AG (Red Star Yeast & Products, Milwaukee, Wise), 6 g/L PP90M 
(DMV International, Fraser, NY) with IX trace elements and 0.1 mUL Mazu 
DPI 0PMOD1 1 (BASF). The feed solutions were sterilized for 40 to 50 minutes underl 8 

15 RSI steam pressure. 



Table 9 : Initial batch composition in 30 L fermentor 





Concentration 


Component 


(g/L orxnUL) 


Sodium hexametaphosphate 


4 


KH^PO^ 


4 


(NH,)2HP04 


4 


CaClj 


0.4 


Citric acid 


0.5 


Amberex 695 (Red Star Yeast & Products, 


20 


Milwaukee, Wl) 




PP90M (DMV International, Fraser, NY) 


15 


Tryptone (Difco, Detroit, Ml) 


5 


1000X Trace elements (see Example 7) 


1 mUL 


CaCOa 


5 


Glycine 


1.83 


Mazu DF10PMOD11 Defoamer (BASF 
Speciality Products, Gurnee, IL) 


0.2 mUL 
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The operating conditions were as follows: initial temperature 31 ""C; RPM 600; air flow 3 
LPM; pH 6.8; pressure 0 Bar. The dissolved oxygen was controlled at 25% saturation 

26 



wo 01/02580 PCT/GBOO/02357 

using air flow rate control loop. The RPM was increased to 750 RPM at 8 hours, 850 
RPM at 9 hours and 950 RPM at the time of temperature shift from 31 to 35.5''C at 9.2 
hours when the culture reached 37.8 OD. Beginning at 9.7 hours back pressure was 
applied up to a maximum of 0.6 Bar to aid in oxygen transfer into the culture. The rate 
5 of temperature shift for induction of thymidine synthesis was 0.2^C per minute. 

After the cell mass reached 20 OD at 600 nm (read after dilution in 50 mM H2S04to 
dissolve salts), 85 mL batch feeds of the sorbitol feed solution (b) were made for each 5 
OD Increase in cell mass. At 16.7 hours sorbitol batch feeding was stopped and 

10 dextrose monohydrate (glucose) feed (a) was started under the control of the DO PID 
control loop of the B. Braun fermentor with a set point of 25%. Simply stated, when 
dissolved oxygen was below 25% sugar feed was off, and when DO was greater than 
25% the sugar feed was set to on. The protocol self regulates the glucose 
concentration keeping the concentration low, but does not allow the culture to starve for 

15 glucose for a very prolonged length of time. Crude nitrogen feeds (500 mL) were made 
at 11 hours, 16.2 hours, 21.2 hours, 28.2 hours. 33.2 hours and 40.5 hours. 

The cell mass, thymidine and deoxyuridine accumulation during the fermentation are 
shown in Figure 5. 

20 

EXAMPLE 12 
Purification of Thymidine from Fermentation broth, 

Dowex Optipore L-285 (The Dow Chemical Company) was suspended in deionized 
25 water and packed into a 48mM diameter glass column making a bed volume of about 
500 mL. The column was washed with 500 mL of 5% NaOH, then washed with 
deionized water to until the effluent was pH 7.0. 

500 mL Fermentation broth with thymidine (TdR) concentration of 4.890 g/L and 
30 deoxyuridine (UdR) concentration of 1 .040 g/L was loaded onto the column. The 
column was washed with two bed volumes of deionized water. TdR and UdR were 
eluted by two bed volumes of 5% reagent alcohol (Ethanol 90.5%, Methanol 4,5%, 
Isopropyl alcohol 5%) followed by two bed volumes of 10% reagent alcohol and two 
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bed volumes of 15% reagent alcohol. The TdR and UdR in the 25 mL fractions is 
shown in Fig. 6. The column was regenerated by washing with 5% NaOH, then with 
deionized water to pH7.0 and the procedure was repeated. The fractions 91-160 were 
pooled together from the two separate runs and dried using a rotary vacuum 
evaporator. Thymidine was re-dissolved in a minimal amount of hot water and 
crystallized at 4''C. Then the crystals were re-crystallized two times and dried in a 55°C 
oven for 1 5 hours. A total of 979.8 mg of crystalline thymidine was obtained with a 
purity of greater than 99% that should be suitable for use as a pharmaceutical 
intermediate. 

The side fractions containing both deoxyuridine and thymidine were pooled with mother 
liquors and reduced by a rotary vacuum evaporator to a final volume of 1200 mL with 
reduced alcohol. The total amount of TdR in this 1200 mL solution was 3571 mg. The 
total recovery (Including the 3571 mg of TdR side fractions) was 93.1%. 
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